An Open Source Punjabi Resource Grammar
نویسندگان
چکیده
We describe an open source computational grammar for Punjabi; a resource-poor language. The grammar is developed in GF (Grammatical framework), which is a tool for multilingual grammar formalism. First, we explore different syntactic features of Punjabi and then we implement them in accordance with GF grammar requirements, to make Punjabi the 17th language in the GF resource grammar library.
منابع مشابه
Developing Punjabi Morphology, Corpus and Lexicon
We describe an implementation of morphology, development of a corpus and building of a lexicon for Punjabi language. Such resources are building blocks for various language technology tasks ranging from part of speech tagging to machine translation. Their importance is further increased by the fact that Punjabi is an under resourced language. We release these resources as open-source.
متن کاملAn Improved System for Converting Text into Speech for Punjabi Language using eSpeak
A large number of text-to-speech (tts) softwares are available for speech synthesis. But it is a challenging task to provide a single generalized system for many languages. eSpeak provides support for several languages including Punjabi. It is an open source application that provides rules and phoneme files for more than 30 languages. This paper discusses some improvements in this formant based...
متن کاملThe Spanish Resource Grammar: Pre-processing Strategy and Lexical Acquisition
This paper describes work on the development of an open-source HPSG grammar for Spanish implemented within the LKB system. Following a brief description of the main features of the grammar, we present our approach for pre-processing and ongoing research on automatic lexical acquisition.
متن کاملDevelopment of the Korean Resource Grammar: Towards Grammar Customization
The Korean Resource Grammar (KRG) is a computational open-source grammar of Korean (Kim and Yang, 2003) that has been constructed within the DELPH-IN consortium since 2003. This paper reports the second phase of the KRG development that moves from a phenomenabased approach to grammar customization using the LinGO Grammar Matrix. This new phase of development not only improves the parsing effici...
متن کاملLexical Stress in Punjabi and Its Representation in PLS
Punjabi is a tonal language and belongs to Indo-Aryan family of languages. Punjabi literature reveals that the suprasegmental phonemes such as Tone, Nasalization and stress are realized at the syllable level. There is abundance of geminated words in which stress Co-occurs on the geminated consonant. The disyllabic words have highest frequency of occurrence. There are very few quadrisyllabic/pol...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011